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Abstract 

In this paper we present a novel generic mapping between Graphical Games and Markov Random 
Fields so that pure Nash equilibria in the former can be found by statistical inference on the latter. 
Thus, the problem of deciding whether a graphical game has a pure Nash equilibrium, a well-known 
intractable problem, can be attacked by well-established algorithms such as Belief Propagation, Junction 
Trees, Markov Chain Monte Carlo and Simulated Annealing. Large classes of graphical games become 
thus tractable, including all classes already known, but also new classes such as the games with 0(log n) 
treewidth. 
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1 Introduction 

Games require for their description data in general exponential in the number of players. Indeed, a normal 
form game of n players and s strategies available to each player needs ns n numbers to be described. This 
is because it is assumed that every player directly depends on the strategy of every other player. This 
exponential dependency of the description size on the number of players is, obviously forbidding for the 
study of games with a large number of players. Furthermore, such exponential complexity may not be 
necessary: for many game domains of interest, such as markets and the Internet, it can be argued that the 
welfare of a player depends directly on only a few other players. This observation allows for much more 
succinct representations of games which exploit the dependencies between players more explicitly than does 
the classical representation. 

One important class of succinct games is that of graphical games, which was suggested by Kearns et 
al. IKLS01I . In a graphical game, we are given a graph with the players as nodes. It is postulated that a 
player's utility depends on the strategy chosen by the player and by the player's neighbors in the graph. 
Such games played on graphs of bounded degree can be represented by polynomially many (in n and s) 
numbers. Graphical games are quite attractive as models of the interaction of agents across a large network 
or market. There has been a host of positive complexity results for this kind of games. It has been shown, for 
example, that correlated equilibria (a sophisticated equilibrium concept suggested by Aumann j Aum74|) 
can be computed in polynomial time for graphical games that are trees [ KKLO03 ], which was later extended 
to all graphical games [Pap05 ]. Moreover, it has been shown that, in some cases, even mixed Nash equilibria 
can be computed efficiently [LKSOl). 

From the famous theorem by Nash HNas51H it follows that both correlated and mixed Nash equilibria 
are guaranteed to exist in every game. The same does not hold for pure Nash equilibria — the deterministic 
counterparts of mixed Nash equilibria. Pure Nash equilibria are not guaranteed to exist and, in fact, deciding 
whether a graphical game has a pure Nash equilibrium is an NP-complete problem. NP -completeness holds 
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even when restricted to the class of graphical games defined on bipartite graphs of degree bounded by 3 and 
3 strategies available to each player (see e.g. [GGS03|). Therefore, a reasonable question is whether there 
exist large classes of graphical games for which deciding the existence of pure Nash equilibria and comput- 
ing one or all pure Nash equilibria can be done efficiently. Moreover, is it possible to design algorithms that 
perform well on general hard instances of the problem? These questions are the focus of the present paper. 

Besides the economy of description, one of the motivations for the introduction of graphical games was 
the intuitive affinity between graphical games and graphical statistical models; indeed, several algorithms 
for graphical games (e.g., IL KS01I IOK0 2D do have the flavor of algorithms for Bayes nets. However, a 
direct connection between graphical games and graphical statistical models had not been made explicit — 
certainly not in the context of pure Nash equilibria. 

In this paper we present a mapping from any graphical game to a Markov Random Field (MRF) with 
the following properties: 

• Finding a Maximum-A-Posteriori configuration of the MRF answers the question of whether the 
graphical game has a pure Nash equilibrium. 

• The marginal probability distributions of the cliques of the MRF constitute a succinct description of 
all pure Nash equilibria of the graphical game. Note that there might be exponentially many pure 
Nash equilibria so we might not be able to compute all of them explicitly in input polynomial time. 

• Sampling the distribution of the MRF provides a randomized algorithm for testing whether a graphical 
game has a pure Nash equilibrium and for computing pure Nash equilibria. 

As a consequence, any statistical inference algorithm (from Belief Propagation to Markov Chain Monte 
Carlo methods [MRR + 53] and Simulated Annealing [KGV87], see sections|4]and|5} can be used to compute 
pure Nash equilibria. Moreover, by combining this mapping with the Junction Tree Algorithm ILS90I 
IJLO9 1 we show that for large classes of graphical games we can compute in polynomial time a succinct 
description of the set of pure Nash equilibria (including, among new results, all previously known efficient 
algorithms for pure Nash equilibria [GGS03]): 

• Graphical Games of bounded treewidth 

• Graphical Games of bounded hypertree width* 

• Graphical Games of 0(log n)-treewidth, bounded neighborhood size and bounded cardinality strategy 
sets 

We believe that the latter class of graphical games is of special interest as a plausible model of networked 
markets. In such games, bounded neighborhood size and bounded number of strategies is a realistic assump- 
tion (and, to some extent, essential for concise representation of the game), but the number of players can 
be very large, while the game graph has a rich cycle structure. In fact, our result for this class of graphical 
games is the first positive result that is not based on some assumption about the cycle structure of the graph. 
Moreover, given that NP-completeness of computing pure Nash equilibria holds even when restricted to the 
class of games defined on graphs of degree 3 and at most 3 strategies available to each player, our result is 
quite tight. For an alternative way to approach large games, exploiting the periodic structure of the graph, 
see [DP05]. 

The structure of the paper is as follows. In section |2] we give the basic definitions and in section |3] we 
describe our reduction from Graphical Games to Markov Random Fields. In section 0] we describe how the 
reduction can be combined with the Junction Tree algorithm to yield polynomial time pure Nash equilibria 
algorithms for large classes of graphical games. Finally, in section |5] we suggest a deterministic algorithm 
for the general case and we describe how randomized algorithms can be derived using Markov Chain Monte 

'Associated with every graphical game is a hypergraph as described in section lzTl hypertree width is a measure of the degree 
of cyclicity of hypergraphs | GLS02 1 and its formal definition is given in section|A]of the appendix. 
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Carlo methods and Simulated Annealing. We, also, suggest the use of Survey Propagation [BMZ02] for 
solving hard instances of the problem. 

2 Preliminaries 

2.1 Graphical Games 

In a game we have n players, 1, . . . , n. Each player p, 1 < p < n, has a finite set of strategies or choices, 
S p , with \S P \ > 2, and a payoff function u p : Y\7=i &i ~~ * ^- The set & = Y\7=i &i * s ca U e d set of strategy 
profiles and we denote the set Yii^p Si by S- p . 

It's clear that, in order to specify a game with n players and s strategies each, we need ns n numbers, an 
amount of information exponential in the number of players. However, players often interact with a limited 
number of other players, and this allows for much more succinct representations: 

Definition 2.1 A graphical game Q = (G, {S p }, {%,}) is defined by: 

• An undirected graph G = (V, E), where V = {1, . . . , n} is the set of players. 

• For every player p G V: 

- A non-empty finite set of strategies S p 

- A payoff junction u p : UieAT( P ) S i ~> N (where M{p) = {p} U {v G V|(p, v) G 

We note that a different perspective from which we can see graphical games, which will be useful later, 
is through the hypergraph they induce. We can imagine that every game Q defines a hypergraph having 
as nodes the players and, for every player p, one hyperedge containing p's neighborhood, M{p) ; suppose 
that we remove duplicate hyperedges if two or more players have the same neighborhood. We denote this 
hypergraph by TL{Q) and we define as primal graph of the game the primal graph^ of TC(Q). 

2.2 Pure Nash Equilibrium - Best Response Function 

Consider a game with n players and strategy sets Si, . . . , S n . For every strategy profile s G S, we denote 
by s p the strategy of player p in this strategy profile and by s_ p the (n — l)-tuple of strategies of all players 
but p. For every s' p G S p and s_ p G 5_ p we denote by (s_ p ; s' p ) the strategy profile in which player p plays 
s' p and all the other players play according to s_ p . 

Definition 2.2 (Pure Nash Equilibrium) 

A strategy profile s is a pure Nash equilibrium if for every player p and strategy t p G S p we have u p (s) > 
p , tp)- 

Intuitively, a strategy profile s is a pure Nash equilibrium if none of the players has a unilateral incentive 
to deviate: the player cannot increase his/her payoff by deviating to another strategy if the other players 
continue to play the strategies in s_ p . 

Definition 2.3 (Best Response Function) 

The Best Response Function of player p is a function BR Mp : 5_ p — > 2 5 p defined by: 

BR Up (s_ p ) = {s p |s p G S p and Vs p G S p : n p (s_ p ; s p ) > u p (s_ p ; s p )} 

^The primal graph G' = (V' , E') of a hypergraph 7i = (V, £) has V' = V and two nodes wi, «2 £ V are connected iff there 
is a hyperedge h 6 £ such that Hi, «2 £ ft. 
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Intuitively, BR Up (s_ p ) is the set of strategies in S p that maximize p's payoff if the other players play s_ p . 

It's easy to see that we can define pure Nash equilibrium in terms of the best response functions of players. 
Indeed, a strategy profile s is a pure Nash equilibrium if, for every player p, s p € BR Up (s_ p ). 

2.3 Markov Random Fields and Statistical Inference 

MRFs: We describe informally the notion of an undirected graphical model and we refer the reader to 
ILau96l for a more detailed description. An Undirected Graphical Model, or Markov Random Field, over 
an undirected graph G = (V, E), \ V\ = n, is a probability distribution that factorizes according to functions 
defined on a set C of cliques of Gr. More precisely, associated with every node v € V is a random variable 
x v taking values from a set X v of values^ . Also, associated with every clique c S C is a potential function 
V> c : Ylv£c —> R+ that depends only on x c = {x v \v € c}. Using this notation the probability distribution 
defined on x = {x v \v <G V} is: 

p(x)=|n^) c 1 ) 

cec 

where Z is a normalizing constant. We'll refer to C as the set of significant cliques of the MRF. 

The principal inference problems: Some of the principal statistical inference problems defined on Markov 
Random Fields are the following and the literature that addresses them is very rich. 

1. Maximum-A-Posteriori (MAP) Estimation: the problem of finding a configuration that is more likely 
under the distribution p(x) , or, more formally, of finding a configuration xmap £ arg max^rj x v p( x ) • 

2. Computing the marginal probability distribution of a particular subset of the nodes or some subsets 
of the nodes simultaneously; usually the marginal probability distributions of the cliques of set C. 

3. Sampling the distribution p(x). 

A crucial observation about the normalizing constant Z: In order to compute Z one has to sum p(x) over 
all configurations x G X = W v& y X v , a computation that would require exponential time in the number 
of the nodes. However, this is not really needed for the above inference problems. Indeed, computing a 
MAP configuration does not change whether we include Z in the computation or not. Also, sampling the 
distribution is usually based on the ratio of probabilities of configurations (e.g. in the Metropolis-Hastings 
sampling method) and so Z is cancelled. Finally, computing marginal probability distributions involves 
summing p(x); this can be done without including constant Z and the resulting function can be normalized 
after the completion of the algorithm, since the marginal distribution is a probability distribution and thus 
must be normalized; now, since the marginal is usually computed on a subset of few nodes, the time needed 
to normalize the computed function is much less than that required to find Z. Thus, a very common practice 
in statistical inference is to assume Z=l for all computations. This is a key assumption as will become clear 
later (see discussion in section PJJl- 

2.4 Clique Trees, Treewidth and the Junction Tree Algorithm 

The Junction Tree Algorithm is one of the most celebrated algorithms for statistical inference and is used 
both for computing marginal distributions and for computing maximum-a-posteriori configurations. It is 

* Usually C is chosen to be the set of maximal cliques of graph G, but in some cases it is a different set. The choice depends on 
the underlying application. 

§ A11 Markov Random Fields that we consider in this paper will have finite sets X v . We will assume that this is the case in the 
rest of the paper. 
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also in the core of numerous other algorithms for MRFs and Bayesian Networks. For a quick description of 
the algorithm we refer the reader to [ WJ03 1; a detailed description can be found in IJLO90llLS 90l. Here we 
describe briefly some ingredients of the algorithm. Before doing so, let's define the notion of a clique tree 
of a graph, which is an equivalent way of defining a tree decomposition as implied by lemma l2i)l 

Definition 2.4 A clique tree of a chordal graph G = (V, E) is a tree T = (C, £), where C is a subset of the 
set of all cliques of graph G, that has the following properties: 

• V clique c of G, 3c' € C s.t. c C c! (thus all maximal cliques of G are nodes of T) 

• Vci, C2 G C, VC3 € C in the unique path between c\ and C2, c\ n C2 C C3 (clique intersection property) 
The width of T is max ce c { \ c j } - 

Definition 2.5 A clique tree of a graph G = (V, E) is a clique tree of some triangulation of G. 

Lemma 2.6 (e.g. |Klo94|) Every clique tree of a graph G is a tree decomposition and vice versa. Thus, the 
treewidth of a graph G is equal to the minimum width of all clique trees of G minus 1. 

Briefly, the Junction Tree Algorithm does the following. It starts from the graph G = (V, E) of the MRF 
and triangulates it to get a chordal graph G' = (V, E'). Then it builds a clique tree T = (C, £) of G' and it 
loads to every node of T a potential function as follows: it assigns the potential function of every significant 
clique of the MRF to exactly one of the nodes of T that contain it and then for every node of T it takes the 
product of the potential functions that were assigned to it; if a node of T has no significant cliques assigned 
to it then its potential function is taken identically equal to one. After doing so, it performs calculations on 
T and computes -using a message passing algorithm- the marginal probability distributions of every clique 
c S C. The marginal distributions of the significant cliques of the MRF are derived from the marginal dis- 
tributions of the cliques of set C by summations. Also, note that by tweaking the algorithm a little it can be 
used for computing Maximum-A-Posteriori configurations (see for example discussion in [WJ03]). 

The single non-standard step of the algorithm is the triangulation of G. Different triangulations lead to 
different clique trees. However, the sizes of the cliques in the clique tree play a crucial role in the running 
time of the algorithm which is proportional to J2cec Tivec \^v\- Note that finding the triangulation that 
leads to the lightest clique tree under this objective function is an NP-hard optimization problem (see e.g. 
[BG01 1 and the references therein) and there are various algorithms for building "good" clique-trees. For an 
overview of these algorithms see for example IB G01llBod05l . If \X V \ = x, Vu £ V, then the running time 
of the algorithm is 0(n ■ x ™dth{T)y which is at least Q{jl . x treewidth(G)+iy 

3 The Reduction 

The reduction that we suggest can be roughly decomposed into two parts, that of translating the graphical 
game to an appropriately defined Markov Random Field and that of solving a statistical inference problem 
on the latter. The reduction is the following: 

Frontend: The Markov Random Field that corresponds to a graphical game Q = (G = (V, E), {S p }, {u p }) 
is defined as follows: 

1. The underlying graph of the Markov Random Field is the primal graph G' = (V, E') of Ti(G). Since 
V = V, every node p € V corresponds to a unique player in V and we will identify the two. 

2. Associated with every node p E V is a random variable with state space S p , i.e. a random variable 
with state space equal to the strategy set of the corresponding player. So X v = S p . 
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3. For every player p G V, its neighborhood, M(p), is a clique c p in G' . Note that there might be 
two players p\ / P2 with c pi = c P2 . The set of significant cliques of the MRF will be the set 



4. We assign to every player p a function f p : Ilp'ecp ^v' ~^ ^+ tnat * s defined as follows (call x{c p ) 
the vector of the random variables corresponding to the players of set c p ): 



where e < 1 is a small constant to be decided later. Intuitively, the function f p (x(c p )) maps a 
selection of strategies x(c p ) for the players of the set M{p) to 1 if the strategy x{c p ) p of player p is a 
best response to the strategies x(c p )- p of his neighbors and to a small constant e otherwise. 

5. We assign to every clique c € C a potential function ip c : Yl p ec ~^ ^+ defined as follows: 



6. Since the Markov Random Field that we defined is parameterized on the choice of e we will refer to it 
using the notation MRF(C?, e). Also, we will refer to the unnormalized (Z=l) probability distribution 
defined on MRF(6, e) as p e (x); i.e. p e (x) = Yicec V'cOe(c)). 

Backend: The computational problems related to pure Nash equilibria are now mapped to statistical infer- 
ence problems of Markov Random Fields as follows: 

a. For any e < 1, finding a MAP configuration, xmap, of MRF(Q, e) answers the question of whether 
the graphical game Q has a pure Nash equilibrium. This fact is stated by the following lemmas which 
are proven in the appendix (call X = \\ v X v ). 

Lemma 3.1 Ve < 1: (Q has a pure Nash Equilibrium) (max xg ^ {p e (x)} = 1) 

Lemma 3.2 Ve < 1; Ifp e (x) = I for some x, then x G argmax^g^ {p e (x)} and x is a pure Nash 
equilibrium of Q. 

b. Computing the unnormalized marginal distributions^ of the significant cliques of MRF(£/, 0) answers 
the question of whether the graphical game has a pure Nash equilibrium and, at the same time, the 
unnormalized marginal distributions constitute a succinct description of all pure Nash equilibria of 
the graphical game". These properties of the unnormalized marginal distributions of the significant 
cliques of the Markov Random Field are stated by the following lemmas which are proven in the 
appendix. 

Lemma 3.3 (Q has a pure Nash Equilibrium) (V clique c ofMRF(Q, 0), 3x*(c) s.t. po tC (x*(c)) ^ 
0), where pq )C {x(c)) is the unnormalized marginal probability distribution of clique c. 

Lemma 3.4 If Po tC (x*(c)) ^ Ofor some clique c ofMRF(G, 0) and some x*(c) then 3 a pure Nash 
equilibrium x + ofQ such that x + {c) = x*(c). 

1 Those computed assuming Z=l as discussed in section l2"3l 

'Note that there can be exponentially many pure Nash equilibria. A succinct description of the set of all pure Nash equilibria 
(or any other object) a; is a string y such that \y\ is polynomial in the description of the game and x = f(y) for some function / 
computable in time polynomial in | as | + \y\. 



C = U pe y{c P }- Obviously, \C\ < \V 




p£V':c p =c 
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Based on lemmas l331 and l3l4l one can build a dynamic programming algorithm that takes as input the 
marginal probability distributions of all the significant cliques of the MRF and outputs all pure Nash 
equilibria in output polynomial time. 

Discussion: We can make the following observations: 

1. The reduction described above is completely generic and translates the problem of computing pure 
Nash equilibria of graphical games to performing statistical inference calculations on Markov Ran- 
dom Fields. The virtue of the reduction is obviously that one can use all the machinery of algorithms 
developed for statistical inference to attack the problem of computing pure Nash Equilibria; any statis- 
tical inference algorithm that computes marginal distributions, maximum-a-posteriori configurations 
or samples distributions defined on Markov Random Fields works (for now in a heuristic sense) for 
finding pure Nash equilibria. In sections |4] and |5] we combine this reduction with different statistical 
inference algorithms and we derive exact deterministic algorithms as well as randomized and heuristic 
methods for computing pure Nash equilibria. 

2. From the proofs of lemmas 1331 and 13741 it follows that if a game Q does not have a pure Nash equilib- 
rium then in MRF(C?, 0) function po(x) is identically zero. In this case, of course, po(x) cannot be a 
probability distribution of a Markov Random Field. However, this does not affect our use of statistical 
inference, because statistical inference algorithms are designed to handle this possibility** . For ex- 
ample, the algorithms that compute marginal distributions do only summations over the unnormalized 
probability distribution of the MRF and at a final stage normalize the computed functions if they are 
not identically zero. Thus, the computations are not affected even if po(x) is identically zero. 

4 Efficient Algorithms For Computing Pure Nash Equilibria 

In this section we use our generic reduction to derive polynomial time algorithms for checking the exis- 
tence of Pure Nash Equilibria and for computing a succinct description of all pure Nash equilibria for large 
classes of graphical games. The Markov Random Fields to which we will reduce our games will be those 
with parameter e = and the statistical inference algorithm which we will base our derivations on is the 
Junction Tree Algorithm described in section 12.41 We note that the algorithms we derive are essentially 
combinatorial and the following simple observation advocates this. Although the Junction Tree Algorithm 
performs arithmetic calculations on the potential functions, the only information we really need from the 
values computed by the Junction Tree algorithm is whether they are zero or positive. Since we start from 
non-negative entries in our potential functions (the entries are either zeros or ones) and since the Junction 
Tree algorithm performs no subtractions, one could change the Junction Tree Algorithm arithmetic to ac- 
count only for whether an entry it computes along the execution is zero or positive. Doing so we need only 
1 bit for every stored entry and there are no arithmetic precision issues that we have to address. Thus, our 
algorithms are essentially combinatorial and our claim that the derived algorithms are polynomial time is 
valid. 

4.1 Graphical Games on Trees and Acyclic Hypergraph Games 

We will show that graphical games defined on trees and graphical games with acyclic hypergraphs (see 
[ BFMY83 1 for a definition of hypergraph acyclicity) have efficient pure Nash equilibria computation schemes 
Note that the class of games defined on trees is a subclass of the class of games with acyclic hypergraphs as 
stated by the following easy lemma. 

"Since MRFs are defined in a distributed fashion, i.e. by potential functions on subsets of the nodes, this is an intrinsic problem 
in statistical inference. 
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Lemma 4.1 Let Q = (G, {S p }, {u p }) be a graphical game and let G be a tree. Then TL(Q) is acyclic. 

So it is enough to prove our claim for the class of graphical games with acyclic hypergraphs. Before doing so 
we define the notion of a join tree for a hypergraph and we present a theorem relating hypergraph acyclicity, 
Graham's Algorithm and Join Trees. For a description of Graham's Algorithm we refer to [BFMY83 1. 

Definition 4.2 A join tree for a hypergraph (V, TL) is a tree T = (V, E), where V = TC, so that, for all 
Vi, V2 £ V and for all u G V in the unique path between v\ and V2, V\ H i?2 Q u. 

Theorem 4.3 hBFMYittV If 11= (V, H) is a hypergraph then: 

(1Z is acyclic ) -4=> (1Z has a join tree ) -£4> (Graham's algorithm succeeds on input 1Z) 

We can now prove our claim: 

Theorem 4.4 Deciding whether a graphical game has a pure Nash equilibrium is in P for all games with 
acyclic hypergraph. Moreover, computing a succinct description of all pure Nash equilibria can be done in 
polynomial time. 

Proof: On input Q = (G = (V, E), S p , u p ), the algorithm proceeds in the following steps: 

1. Apply Graham's Algorithm onH(Q) to check whether it is acyclic; if so, Graham's algorithm returns 
a join tree T = (C,£) for H(G) (for details see [BFMY83|). It is easy to see that T is a clique tree 
for the primal graph G' of the game. 

2. Reduce Q to MRF(£/,0) as described in section |3] The graph of the MRF is the primal graph of the 
game and so T is a clique tree for the graph of MRF(C/,0). 

3. Run the Junction Tree Algorithm on T to compute the marginal probability distributions of the sig- 
nificant cliques of MRF(£/,0). 

4. The marginal probability distributions answer the question of whether Q has a pure Nash equilibrium 
and also constitute a succinct description of all pure Nash equilibria of Q (see section 

Correctness: The correctness of the algorithm follows from the correctness of the reduction and the cor- 
rectness of Graham's algorithm and the Junction Tree algorithm. Note, also, that since the Junction Tree 
algorithm maintains supportiveness [LS90| it won't be affected by any "divisions by zero". 
Time Complexity: Graham's algorithm finds the join tree of H(G) in time polynomial in the size of the 
hypergraph and, thus, in the number of players. Reducing Q to MRF(£/,0) also takes polynomial time. It 
remains to bound the running time of the message-passing phase of the junction tree algorithm. This phase is 
executed on the clique tree, which is precisely the join tree that Graham's algorithm returned, and it involves 
the exchange of as many messages as twice the number of edges of the clique tree, so at most 2n — 2 
messages, where n is the number of players (note that the number of nodes of the clique tree is at most equal 
to the number of players). Now, the time needed to compute a message that is being sent over an edge of the 
clique tree is polynomial in the size of the tables (potential functions) that are stored at the endpoints of that 
edge. However, the clique in every node of the clique tree corresponds to the neighborhood of a player and 
so its table has the same size as the table describing the utility function of that player. Thus, the complexity 
of every message is polynomial in the input complexity. It is, thus, obvious that the described algorithm 
runs in time polynomial in the description of the graphical game. ■ 
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4.2 Games of Bounded Treewidth and Games of Bounded Hypertree-Width 

It is easy to extend the results of section 14.11 to broader classes of graphical games, those of bounded 
treewidth and bounded hypertree width respectively. The hypertree width of a graphical game Q is the 
hypertree width of H(Q). For a formal definition of the latter we refer the reader to [GLS02], but for quick 
reference we provide a definition in section |A] of the appendix. Our results are stated by theorems 14.61 and 
14.71 The latter was also proven independently in [GGS03] using different techniques. All proofs of this 
section are postponed to the appendix. The proof of theorem l4~6l uses lemma l4~5l 

Lemma 4.5 If the graph G = (V, E) of a graphical game Q has treewidth bounded by k then its primal 
graph has treewidth bounded by (k + 1) • max pg y — 1. Moreover, given a tree decomposition of G 

of width k we can compute in polynomial time a clique tree for the primal graph of Q of width (k + 1) ■ 
maxpgy \N"(p)\. 

Theorem 4.6 Deciding whether a graphical game has a pure Nash equilibrium and computing ( a succinct 
description of) all pure Nash equilibria is in P for all classes of games with bounded treewidth. 

Theorem 4.7 Deciding whether a graphical game has a pure Nash equilibrium and computing ( a succinct 
description of) all pure Nash equilibria is in P for all classes of games with bounded-hypertreewidth. 

4.3 Games of O (logn) -Treewidth 

The classes of graphical games for which we have derived efficient pure Nash equilibria computation 
schemes are quite broad. However, often in games of practical interest the treewidth and hypertree width of 
the underlying graph are not bounded. In this section we go one step further in our study of graphical games 
and study classes of games with Oilogn) treewidth. For these classes, we derive polynomial time pure 
Nash equilibria computation algorithms under the assumption of bounded neighborhood size and bounded 
cardinality strategy sets. Given that NP-completeness of computing pure Nash equilibria holds even when 
restricted to the class of graphical games with bipartite graphs of degree at most 3 and strategy sets of car- 
dinality at most 3, our result is quite tight. Moreover, it is the first positive result for computing pure Nash 
equilibria that is not based on some assumption about the cycle structure of the graph. For a different set- 
ting for studying big games where succinct description is required see [DP05|. Also, see theorem l4~9l for a 
relaxation of the bounded neighborhood requirement. 

Theorem 4.8 Deciding whether a graphical game has a pure Nash equilibrium is in P for all classes of 
games with 0(log n) treewidth, bounded cardinality strategy sets and bounded neighborhood size. More- 
over, computing a succinct description of all pure Nash equilibria can be done in polynomial time. 
Proof: The algorithm is similar in spirit to the ones presented so far, but differs in the construction of the 
clique tree on which the Junction Tree algorithm performs. This task is somewhat involved and is described 
here. Suppose k = k(n); by slightly modifying the algorithm presented by Becker and Geiger I BG01I . we 
get an algorithm that runs in time poly(n)-2 Am ' k , on input a graph G of n nodes, and either outputs a tree de- 
composition of G of width at most 3.67k or outputs that the treewidth of G is larger than fc'' . For k = c log n, 
where c is a fixed constant, we get an algorithm that runs in time polynomial in n and either returns a tree 
decomposition of the input graph G of width at most 3.67c log n or outputs that the treewidth of G is larger 
than clog n. Now, suppose Q = (G, {S p }, {u p }} is a game drawn from a family of games with treewidth at 
most clog n. Applied to G, the algorithm returns a tree decomposition of G of width at most 3.67 • c ■ log n. 
Given this tree decomposition, from lemma l4~5l we can construct in polynomial time a clique tree for the 

^Alternatively we could use Reed's approximation algorithm |Ree92| for treewidth or other approximation algorithms. 



9 



primal graph of Q, which is the graph of the MRF, of width w = (3.67 • c • logn + 1) • max pe y \M(p) |. Now, 
if we assume bounded cardinality strategy sets, it follows that the sizes of the tables (potential functions) 
that will be stored in the clique tree before the execution of the junction tree algorithm and, thus, all the 
messages exchanged during the execution have size O ( n ( max I^WI)j. jf ; moreover, we assume bounded 
neighborhood size they are polynomial in the number of players. So all the computation takes polynomial 
time. ■ 

We can get rid of the bounded neighborhood requirement by pushing the 0(log n)-treewidth requirement to 
the primal graph of the game as stated by the following theorem which is proven in the appendix. This way 
we can in some cases accommodate neighborhoods of size up to 0(log n) which might be helpful in some 
applications. 

Theorem 4.9 Deciding whether a graphical game has a pure Nash equilibrium is in P for all classes of 
games with primal graphs oftreewidth 0(log n) and bounded cardinality strategy sets. Moreover, computing 
a succinct description of all pure Nash equilibria can be done in polynomial time. 

5 Further 

5.1 A General Algorithmic Scheme 

In section|4]we combined our reduction with the junction tree algorithm and derived polynomial time algo- 
rithms for computing pure Nash equilibria for large classes of graphical games. The high level schema of 
our algorithms is the following: 

• Reduce input game Q to MRF(£/, 0). 

• Find a good clique tree of the MRF graph. 

• Run the junction tree algorithm on the clique tree. 

In essence, our algorithms try to reconcile two things. On the one hand, the running time of the junction tree 
algorithm crucially depends on the width of the clique tree compared to the size of the largest neighborhood 
of the game. On the other hand, computing the optimal clique tree is an NP-hard problem. To circumvent this 
difficulty we can make the following observation. We do not really need to find the optimal clique tree. To 
preserve efficiency, a constant factor approximation is enough: if the junction tree algorithm on the optimal 
clique tree runs in polynomial time, then it runs in polynomial time on a constant factor approximation of the 
optimal tree. Moreover, the running time of the junction tree algorithm will be polynomial in n ■ s w , where 
s is the number of strategies and w is the width of the tree decomposition we achieve. So, an algorithm 
for computing a constant factor approximation to the treewidth is sufficiently fast if its running time is 
polynomial inn ■ s w , where w is the constant factor approximation to the treewidth that it achieves. In fact, 
there are various approximation algorithms for treewidth that have this property. The one given in [ BG01I 
runs in time proportional to poly(n) ■ 2 4,67 ' fe , where k is the treewidth, and returns a clique tree of width 
at most 3.67 • k; also Reed's algorithm [Ree92|, Robertson and Seymour's algorithm [RS95| and other 
algorithms [Bod05| have this property. Therefore, we have optimal algorithms under the above scheme. 
Notably, all algorithms presented in section^\can be derived as special cases of this scheme. Thus, our 
general algorithm encompasses all the positive results for computing pure Nash equilibria known to date 
plus the new positive results discovered in this paper. 

5.2 Heuristics 

The use of the junction tree algorithm provided the deterministic polynomial time algorithms presented in 
the previous sections. However, our reduction permits the use of a large variety of statistical inference algo- 
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rithms to attack our problem. In this section we comment on two families of statistical inference algorithms 
that we believe are promising in solving graphical games: Markov Chain Monte Carlo methods and Survey 
Propagation. 

5.2.1 Sampling and Simulated Annealing 

Our definition of MRF(C?, e) ensures that for some small e{Q) < 1 most of the probability mass is con- 
centrated on the set of pure Nash equilibria of Q, if equilibria exist. More generally, the probability of a 
configuration decays exponentially in the number of unsatisfied players. This observation suggests quite 
naturally the use of sampling techniques to find pure Nash equilibria of games or approximations to pure 
Nash equilibria, i.e. configurations with as many satisfied players as possible. Markov Chain Monte Carlo 
methods and Simulated Annealing (see e.g. [GRS96|) can be easily applied for computing pure Nash equi- 
libria under our reduction. 

5.2.2 Survey Propagation Algorithms for solving Hard Instances 

An algorithm motivated by statistical inference that appears to be very effective in solving random fc-SAT 
instances and other constraint satisfaction problems is survey propagation (e.g. [BMZ02|). Survey prop- 
agation performs better than belief propagation for A;-SAT instances near the SAT/UNSAT threshold (see 
e.g. IAP0 3 1 for the later). In [MMW05] survey propagation is extended to a family of survey propagation 
algorithms (parameterized by a real number p G [0,1]) that has at one extreme (p = 0) belief propagation 
and on the other extreme (p = 1) survey propagation. By reducing graphical games to Markov Random 
Fields we can use all algorithms of this family to compute pure Nash equilibria. We believe that survey 
propagation algorithms for pure Nash equilibria will be very useful for solving hard instances of graphical 
games. 
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APPENDIX 

A Hypertree Decompositions and Hypertree width 

For quick reference, we define here the notion of a hypertree decomposition and hypertree-width of a hyper- 
graph. For more details on the properties of hypertree-width and its relation to other notions of hypergraph 
acyclicity we refer the reader to IG LS02II . 

Definition A.l (|GLS02|) Let 1Z = (N, TL) be a hypergraph. A hypertree decomposition of 1Z is a triplet 
(T, x, A), where T = (V, E) is a rooted tree and x, A are labelling functions associating each vertex v G V 
with two sets x( v ) ^ N an d A(u) C TL, so that: 

1. V/t G TL,3v G V : h C X (v) 

2. Vra G M, the set {y G V\n G x( v )} induces a connected subgraph of T 

3. VveV, X (v)C)J heXiv) h 

4. VveV, x (T v )nU heX{v) hc x (v) 

where T v is the subtree of T rooted at v and x(T„) = Ui/ei>ert(T„) x(V) 
The width of (T, x, A) is max 1 , e y{|A(f )|}. 

The hypertree width of a hypergraph 7£, hw(lZ), is the minimum width over all its hypertree decompositions. 

B Missing Proofs 

Proof of lemma \3.1\ 
(=>) We have: 

x is a pure Nash equilibrium of Q 

=> Vp G V : x(c p )p G BR Up (x(c p )_ p ) 
=> Vp G y' : fpixicp)) = 1 
Vc G C : ^ c (x(c)) = 1 

p e (x) = 1 

=>■ x G arg max ( since by definition p e (x) < 1, Vcc) 

x£X 

maxBffx) = 1 
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(<=) We have: 



Q has no pure Nash equilibria 
$x G X s.t. Vp G V : x(c p )p G BR Mp (x(c p )_ p ) 

G # s.t. Vp G V' : f P (x(cp)) = 1 ^ 
Jx G AT s.t. Vc G C : ^ c (x(c)) = 1 

^x G X s.t. p e (x) = 1 =^ 

maxp e (x) < 1 ( since by definition p e (x) < l,Vx) 

x^X 

1 then x G argmax xe ^p (E (x) since p e (x) < 1 by definition (e < 1). Also: 

Pe(x) = 1 W 

Vc G C : ^ c (x(c)) = 1 
Vp G V' : f P (x(c p )) = 1 
Vp S 7' : x(c p )p G BR Up (x(c p )_ p ) 44> 
x is a pure Nash equilibrium 

Proof of lemma \3.3\ 

Note that the marginalization of po(x) with respect to a clique c of MRF(£/, 0) is simply a summation 
Po,c(x(c)) = Yl Xv v£cPo( x )- Now it is easy to prove the claim as follows: 

(=>) If G has a pure Nash equilibrium x* then poO^*) = 1 (see proof of lemma l3^2t . It follows that 
Po,c(x*(c)) > for every clique c of the MRF. 
(<=) Proof by contradiction: 

Q does not have a pure Nash equilibrium 

=>Vx,3ps.t. f p {x(cp)) = e 

e =$~Vx,po(x) = 

=4>V clique c,Vx(c) : Po,c(x(c)) = 

■ 

Proof of lemma \3.4\ 
Proof by contradiction: 

There is no pure Nash equilibrium x + of Q such that x + (c) = x*(c) 
=>■ Vx with x(c) = x*(c) : po(^) = 
=^Po,c(x*(c)) = ^ Po ( x ) = 

:r:ir(e)=iE*(c) 



Proof of lemma ul2r Ifp e 
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Proof of lemma W~5\ (sketch) Let T = [C, 8) be a tree decomposition of the game graph G = (V, E), where 
C C 2 y , and suppose that k = max cg c |c| — 1 is the width of the decomposition. We show how to construct 
a tree decomposition T' of the primal graph of Q of width at most (k + 1) • m&x pe y |-A/"(p)| ~~ 1- T' is 
isomorphic to T and let a denote the one-to-one correspondence between vertices of T and T'. Then for 
all c E C we set a(c) = U pec N(p), i.e. every vertex of T" contains the union of the neighborhoods of all 
players of the corresponding vertex of T. It is not difficult to see that T' is a tree decomposition of the primal 
graph G' = (V, E') of the game. Indeed, for every edge (it, v) G E' there is a vertex of T' that contains both 
it, v. if (it, v) is an edge in G' then players it, v must belong in the neighborhood of some player p (maybe 
p = u or p = v); but there is at least one vertex of T that contains p and, thus, the corresponding vertex of 
T' must contain all its neighborhood and, so, u, v as well. Moreover, for every player p € V the vertices 
of T' that contain p form a connected subtree of T'\ since T is a tree decomposition of G, it is easy to see 
that the vertices of T that contain player p or a neighboring player of p in G form a connected component 
in T; but, in T', p appears in exactly those vertices whose corresponding vertex in T contains either p or a 
neighbor of p in G, and, so, the nodes in which p appears must form a connected component in X". Finally, 
since k + 1 = max ce c |c|, every vertex of T' contains at most (fc + 1) • max pg y |A/*(p)| vertices. Given T, 
the construction of T' can be done in polynomial time. ■ 

Proof of theorem l4loT For a fixed constant the algorithm performs the following steps on input Q = (G = 

(V,E),{S p },{u p }): 

1. Check whether G has treewidth bounded by k and, if so, find a tree decomposition of G of width at 
most k. For details on how to perform this step in polynomial time see for example IACP87llKlo94l . 

2. From the tree decomposition of G get a clique tree T' of the primal graph of Q that has width at most 
(k + 1) ■ maxpgy |A/"(p)| using lemma 1431 

3. Reduce Q to MRF(£?,0) as described in section |3] The graph of the MRF is the primal graph of the 
game and so T' is a clique tree for the graph of MRF(<7,0). 

4. Run the Junction Tree Algorithm on T' to compute the marginal probability distributions of the sig- 
nificant cliques of MRF(C?,0). The marginal probability distributions answer the question of whether 
Q has a pure Nash equilibrium and also constitute a succinct description of all pure Nash equilibria of 
Q. 

The correctness of the algorithm follows easily from the correctness of every intermediate step. The running 
time of the algorithm is polynomial. This is easily proven using the same rationale as in the proof of theorem 
14.41 However, in this case the cliques contained in the nodes of the clique tree have size that is at most 
(k + 1) ■ maxpgy i.e. at most k + 1 times the size of the biggest neighborhood of the game. Since 

the biggest table of the input has dimension max pe y |JV(p)| and the biggest table in the clique tree has 
dimension k + 1 times max pg y |A/"(p)|, where k is fixed, it follows that the tables of the clique tree have 
size polynomial in the input complexity. Thus, the algorithm runs in polynomial time. ■ 

Proof of theorem W7\ The proof is based on the following graph-theoretic lemma. 

Lemma B.l If a graphical game Q = (G = (V, E), {S p }, {u p }) has hypertree width bounded by k then its 
primal graph has treewidth bounded by k ■ max^gy |-A/"(p)| — 1- Moreover, given a hypertree decomposition 
(T, x, A) of 7i{Q) we can compute in polynomial time a clique tree for the primal graph of Q of width 
k ■ maxp g y |A/"(p)|, where k is the width of(T, x, A). 

Proof: Given a hypertree decomposition (T, %, A) of TL{Q), take T' to be a tree isomorphic to T after 
removing directions from the edges and let a denote the one-to-one correspondence between vertices of T 
and T'. Then for all v € vert(T) we set a(v) = x( v )- It i s eas Y t0 see tnat T' is a clique tree for the 
primal graph of Q, using properties 1 and 2 of the hypertree decomposition (see definition lA.il) . Moreover, 
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from property 3 it follows that for every node v of T: x( v ) ^= Uhe\(v) ^- Thus, for every node v of T: 
\x{ v )\ < k • maxpgy |A/"(p)|> where k is the width of the hypertree decomposition (T, x,A). Thus, the 
clique tree T' has width at most • max pg y The construction can obviously be done in polynomial 

time. ■ 

For a fixed constant k the algorithm performs the following steps on input Q = (G = (V, E), {S p }, {u p }): 

1. Check whether TL{Q) has hypertree width bounded by k; if so find a hypertree decomposition (T, x, A) 
of T~C(G) of width at most k. For details on how to perform this step in polynomial time see I GLS02I . 

2. From the hypertree decomposition (T, x, A) get a clique tree T' of the primal graph of Q that has 
width at most k ■ max pe y |A/"(p)| using lemma iBTl 

3. run algorithm of theorem l4~6l from step 3; 

The correctness and the time complexity of the algorithm are analyzed in the same way as in the proof of 
theorem l4l6l ■ 

Proof of theorem \4.9l The algorithm is similar in spirit to the one presented in the proof of theorem 14.81 
Again, we use the modified -as described in the proof of theorem 14781 - algorithm of Becker and Geiger for 
k = c log n, where c is a fixed constant, but we apply it directly on the graph G of the MRF The algorithm 
runs in polynomial time and if G has treewidth bounded by clog n, it returns a tree decomposition of G of 
width at most 3.67c log n. So the biggest table in the clique tree will be of dimension 3.67c log n. Thus, 
assuming bounded cardinality strategy sets, the sizes of the tables (potential functions) that we store at the 
nodes of the clique tree are polynomial in the number of players so all the computation takes polynomial 
time. ■ 
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